Augmenting Presentation MathML for Search
نویسندگان
چکیده
The ubiquity of text search is both a boon and bane for the quest for math search. A bane in that user’s expectations are high regarding accuracy, in-context highlighting and similar features. Yet also a boon with the availability of highly evolved search engine libraries; Youssef has previously shown how an appropriate ‘textualization’ of mathematics into an indexable form allows standard text search engines to be applied. Furthermore, given sufficiently semantic source forms for the math, such as LTEX or Content MathML, the indexed form can be enhanced by co-locating synonyms, aliases and other metadata, thus increasing the accuracy and richness of expression. Unfortunately, Content MathML is not always available, and the conversion from LTEX to Presentation MathML (pMML) is too complex to carry out on the fly. Thus, one loses the ability to provide query-specific, fine-grained highlighting within the pMML displayed in search results to the user. Where semantic information is available, however, such as for pMML generated from a richer representation, we propose augmenting the generated pMML with those semantics from which synonyms and other metadata can be reintroduced. Thus, in this paper, we aim to have both the high accuracy introduced by semantics while still obtaining fine-grained highlighting.
منابع مشابه
Augmenting Mathematical Formulae for More Effective Querying & Presentation
1 Summary Scientists and engineers search regularly for well‐ established mathematical concepts, expressed by mathematical formulae. Conventional search en‐ gines focus on keyword based text search today. An analogue approach does not work for mathe‐ matical formulae. Knowledge about identifiers alone is not sufficient to derive the semantics of the formula they occur in. Currently, for formula...
متن کاملA Family of Modular XML Schemas for MathML
MathML is a complex XML application that can, in fact, benefit from a schema definition. One problem in defining such a schema is to develop an architecture that captures the logical structure of MathML. The MathML definition provides two sorts of markup, presentation markup which captures the notational aspects of mathematics, and content markup which captures the meaning of mathematical expre...
متن کاملIndexing and Searching Mathematics in Digital Libraries
This paper surveys approaches and systems for searching mathematical formulae in mathematical corpora and on the web. The design and architecture of our MIaS (Math Indexer and Searcher) system is presented, and our design decisions are discussed in detail. An approach based on Presentation MathML using a similarity of math subformulae is suggested and verified by implementing it as a math-aware...
متن کاملThe Utility of OpenMath
OpenMath [5] is a standard for representing the semantics of mathematical objects. It differs from ‘Presentation’ MathML [7] in not being directly concerned with the presentation of the object, and from ‘Content’ MathML in being extensible. How should these extensions be performed so as to maximise the utility (which includes presentation) of OpenMath?
متن کاملMathML-OpenMath Interface for REDUCE
Description OpenMath and MathML are two ways of representing mathematical objects. Semantically, OpenMath is a superset of (content) MathML. The aim is to build a translator from OpenMath to content MathML, using presentation MathML where necessary as in the example of rank in Section 5.3 on MathML OpenMath is extensible, the translator will need to be. There is no a priori choice of implementa...
متن کامل